Towards a Software-Hardware Co-Designed Resilient System

نویسندگان

  • Man-Lap Li
  • Pradeep Ramachandran
  • Sarita V. Adve
  • Vikram S. Adve
  • Yuanyuan Zhou
چکیده

With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field faults. To be broadly deployable, the hardware reliability solution must incur low overheads, precluding use of excessive redundancy. We explore a co-designed hardwaresoftware solution that treats most hardware faults as software bugs and leverages common mechanisms for hardware and software reliability, thereby amortizing some of the overhead. Fundamental to such a solution is a characterization of how hardware faults in different microarchitectural structures of a modern processor propagate through the application and OS. In this paper, we first summarize such a characterization for permanent faults. Motivated by this characterization, we discuss our software-hardware co-designed approach for detecting, diagnosing, recovering from, and repairing/reconfiguring around hardware errors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Understanding the Propagation of Hard Errors to Software and its Implications for Resilient System Design

With continued CMOS scaling, future shipped hardware will be increasingly vulnerable to in-the-field faults. To be broadly deployable, the hardware reliability solution must incur low overheads, precluding use of expensive redundancy. We explore a co-designed hardware-software solution that treats most hardware faults as software bugs and leverages common mechanisms for hardware and software re...

متن کامل

Design Issues in Hardware/Software Co-Design

The complexity of designing electronic systems and products is constantly increasing. The increasing complexity is due to the factors such as: portability, increased complexities of software and hardware, low power and high speed applications etc. Due to all these factors the electronic system design is moving towards System on Chip (SoC) with heterogeneous components like DSP, FPGA etc. This c...

متن کامل

An Automata-Theoretic Approach to Hardware/Software Co-verification

In this paper, we present an automata-theoretic approach to Hardware/Software (HW/SW) co-verification. We designed a co-specification framework describing HW/SW systems; synthesized a hybrid Büchi Automaton Pushdown System model for co-verification, namely Büchi Pushdown System (BPDS), from the co-specification; and built a software tool for deciding reachability of BPDS models. Using our appro...

متن کامل

Performance and Analysis of Low Power Error Resilient Multi Input Multi Output Detectors

Multiple-antenna (MIMO) technology is becoming mature for wireless communications and has been incorporated into wireless broadband standards like LTE and Wi-Fi, the above all detectors facing the power consumption problem. The proposed error resilient K-best MIMO detector system is designed using Euclidian bi-Orthogonal architecture for the 4 × 4 64-QAM system achieves the better power consump...

متن کامل

Resilient Project Management, A New Approache to Develop Project Management Knowledge (Case Study: Infrastructure Civil Projects Management)

Accepetance of the fact that the working context of civil projects is challenging can enhance the resiliency capacity and will increase the project management concentration for improving and developing the software and hardware capabilities to facilitate project success achievement. This article is documented based on a research results in macro-hydropower plants projects management context to ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007